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ABSTRACT 

In order to investigate discrepancies between recent published estimates of the the HI mass function 
(HIMF), we explore the impact of distance uncertainties on the derivation of the faint end slope 
of mass and luminosity functions of galaxies in the local volume by deriving HIMFs from mock HI 
surveys. We consider various survey geometries and depths and compare the HIMFs measured when 
using "real" distances, distances derived by assuming pure Hubble flow and distances assigned from 
parametric models of the local velocity field. The effect is variable and dependent on the exact survey 
geometry, but can easily lead to incorrect estimates of the HIMF, particularly at the low mass end. We 
show that at least part of the discrepancies among recent derivations of the HIMF can be accounted 
for by the use of different methods to assign distances. We conclude that a better understanding of 
the local velocity field will be necessary for accurate determinations of the local galaxy luminosity and 
mass functions. 

Subject headings: galaxies: luminosity function, mass function — galaxies: distances and redshifts — 
radio lines: galaxies 
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1. INTRODUCTION 

One of the perplexing discrepancies between obser- 
vations and CDM theory revolves around the "missing 
satellite" problem. Numerical simulations in ACDM pre- 
dict a value for the logarithmic slope of the faint end of 
the halo mass function, a, which is close to the value 
of —1.8 that arises analytically from the Press-Schechter 
formalism (Press & Schechter 1974), but most recent de- 
terminations of the optical luminosity function (LF) in 
the local volume yield values of a that are significantly 
flatter. Complementary to the results on the optical 
LF, several determinations of the local HI mass func- 
tion (HIMF) have likewise yielded relatively flat faint-end 
slopes. The over-prediction of the number of low mass 
objects relative to those actually observed is considered 
one of the last remaining stumbling blocks for ACDM, 
and points towards a new understanding of the baryon 
physics of galaxy formation. It is obviously desirable 
that all of the uncertainties in the observed luminosity 
and mass functions be well understood. 

To derive a luminosity (or mass) function from observa- 
tional data, the sample selection effects must be consid- 
ered carefully. In most surveys, the lowest luminosity (or 
mass) objects are only visible nearby, and their contribu- 
tion must be weighted accordingly. Fractional distance 
errors can be large for relatively nearby objects, and can 
thus have a very strong impact on determinations of the 
faint end characteristics of luminosity (or mass) func- 
tions. When those functions are evaluated using data 
obtained in clusters, the principal source of uncertainty 
lies in the assessment of membership; assuming that un- 
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certainty is to first order independent on luminosity (or 
mass), it can have a small impact on the estimate of LF 
slopes. Determinations of the HIMF, however, have to 
rely on relatively shallow, wide area surveys to detect 
low HI mass objects over a sufficient volume, and are 
thus most susceptible to uncertainties associated with 
distance errors in the nearby population. 

Primary distances (eg. cepheids, the tip of the red gi- 
ant branch or surface brightness ffuctuations, the best of 
which give distances to ± 10%) are available for ^ 600 
galaxies, most within ~ 30 Mpc. Secondary distances 
(which rely on the above methods for calibration) carry 
larger errors (~ 15-30%) but do account for peculiar ve- 
locities for some 5000 objects. For all other galaxies, the 
only indication of distance is their redshift. Outside of 
cz — 6000 km s^^, pure Hubble ffow works well, since a 
typical peculiar velocity of a few hundred km s~^ creates 
an uncertainty on the redshift distance of < 10%. For 
closer galaxies, however, the relative error is much larger, 
and, for the nearest objects, whose recessional velocities 
are dominated by peculiar motions, the error can be a 
factor of two or more. Furthermore, peculiar velocities 
are not random but exhibit coherent ffows towards local 
over-densities. In some directions, the redshift-distances 
of all (nearby) galaxies will be systematically under- or 
over- estimated if peculiar velocities are ignored. 

The effects of peculiar velocities on distance determi- 
nations can be mitigated by the use of parametric mod- 
els of the local velocity field. While clearly oversimpli- 
fications of the real flows, such models give a direction- 
dependent redshift-distance prescription. A recent ex- 
ample of such a model is the multi-attractor model of 
Tonry et al. (2000; hereafter TBOO: also see Wflhck & 
Batra 2001). 

In order to investigate the effects of distance uncertain- 
ties on local luminosity or mass functions, we consider 
here the derivation of the HIMF. Recent determinations 
of the local HIMF have yielded discrepant results on the 
abundance of low HI mass objects, namely: Zwaan et 
al. (1997; Z97), Rosenberg & Schneider (2002; RS02) and 
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TABLE 1 
Parameters in published HIMFs 



Reference 


Z97 


RS02 


Z03 


Survey 


AHISS 


ADBS 


HIPASS 


<;!>*[/if(,Mpc-3] 


0.0048 


0.0047 


0.0070 


log(M*/M©) + 2Iog?i70 


9.86 


9.94 


9.85 


a 


-1.20 


-1.53 


-1.30 


Survey area 


65 deg^ 


430 deg2 


20,600 deg2 


Aim for 108 Mq 


100 Mpc 


13 Mpc 


7 Mpc 


# galaxies 


61 


265 


1000 



Zwaan et al. (2003; Z03). The Z03 HIMF is based on the 
HI Parkes All Sky Survey (HIPASS) Bright Galaxy Cat- 
alogue (BGC), while the Z97 and RS02 HIMFs are both 
based on drift scan surveys conducted at Arecibo dur- 
ing the period of its recent upgrade (respectively called 
AHISS, the Arecibo HI Strip Survey, and ADBS, the 
Arecibo Dual Beam Survey). A summary of the results 
of those surveys is provided in Table 1, parameterized as 
Schechter functions, 
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where x = M/Mi,, M is the HI mass, and n is the total 
number density of galaxies. The faint end slopes of the 
HIMF derived by the three surveys vary between —1.20 
and —1.53, yielding extrapolations below Mhi = lO'' 
Mq that disagree by an order of magnitude, the RS02 
HIMF having the steeper slope. 

Previous studies of the impact of distance errors (from 
ignoring peculiar veloc;itics) on HIMFs have modeled the 
effect, often called the 'Eddington effect', by adding ran- 
dom Gaussian noise to the distances. Using this model, 
RS02 conclude that the effect is negligible in their sur- 
vey volume until velocity dispersions of 600 km s^^ are 
reached, while Z03 find that for an assumed velocity dis- 
persion of 50-100 km s~^ in the local universe {cz < 3000 
km s~^) the slope of the low mass end of the HIMF prob- 
ably steepens by Aa < 0.05. Since the peculiar veloc- 
ity field in the local universe is not random but rather 
dominated by in-fall onto the Virgo cluster, these mod- 
els could easily hide the main bias. In this paper, we 
investigate whether all or part of the discrepancy among 
the three HIMF determinations can be accounted for by 
the effects of distance uncertainties introduced by the 
different methods used by Z97, RS02 and Z03 to assign 
distances. 

Here, we construct mock catalogs designed to mimic 
the HI surveys conducted by Z97, RS02 and Z03 and 
then derive mock HIMFs under different distance as- 
sumptions. Ho = 70 km s^'^Mpc""'^ is assumed through- 
out, and the dependence on Ho is explicitly listed using 
hro = Ho/{7Q km s"^Mpc"^). 

2. SURVEY SIMULATIONS 

We first construct a mock catalog of HI rich objects in 
the local universe, which is then "observed" . In order to 
account for the known local variations in the 3D distri- 
bution of galaxies, a version of the density and velocity 
fields derived from the IRAS Point Source Catalog Red- 
shift Survey (PSC^) with a value (3 = fl^l^s/b = 0.5 is 



used (kindly provided by E. Branchini, see Branchini et 
al. 1999 for details). This map extends to a distance of 
120 Mpc with a grid spacing is 0.9375 Mpc in 
the inner 60 Mpc, and twice that in the outer shell. 
The map has been smoothed with a Gaussian filter of 
radius 3.2 Mpc. 

The volume is seeded with HI rich galaxies according 
to the density field, and masses are assigned from a given 
HIMF. The HI sizes of the galaxies and velocity widths 
are calculated according to empirical scaling relations 
derived from our own HI survey data and from Broeils 
(1992). Inclinations are randomized, realistic scatter is 
added to the scaling relations (as measured from the em- 
pirical fit) and a turbulent velocity of 10 km is added 
to construct the observed velocity width. 

The HI mass of a galaxy at a distance D (in Mpc), 
and with observed HI line flux can be approximated as 
Mhi/Mq ~ 2.356 X lO'^D^Sp^^i.Wm, where S^peak is the 
peak fiux in the HI line in Jy and Whi is its width in 
km s~^. The detectability of a HI rich galaxy depends 
on both the peak flux and the width of the HI line. If 
wc assume that smoothing of the spectrum is applied 
to match the width of the lines, up to a maximum of 
W^smo — 100 km s~^, the signal-to- noise ratio can be 
derived from the standard radiometer equation such that 



S/N 



6-06 (ifS^) (l^)V/3ii/MCBW/25kHz) 



{D^T,y./G) 



(2) 

where 7 = —3/4 for Whi < 100 km s ^, and 7 = — 1 for 
Wm > 100 km s~^, accounting for the effects of smooth- 
ing; f/s is the fraction of the source covered by the tele- 
scope beam; tint is the integration time; CBW is the 
channel bandwidth and Tgys/G describes the gain of the 
telescope. Specific details of the simulated observations 
are given in Section 3 below. 

The observed HI mass of a galaxy is 
Mobs = Mtruo(£'obs/-Dtruo)^, wherc Dobs is the dis- 
tance assigned to the galaxy and -Dtruc is its true 
distance. From the set of "detections" wc construct 
HIMFs by summing 1/Mim (the total volume within 
which a galaxy can be seen) weighted by the average den- 
sity in that volume (from the PSC2; density field) in bins 
of log(M////MQ). This method prevents over- or under- 
dense regions from giving an over- or under-estimate 
of number counts in certain bins. If -Dobs — -Dtruei 
we recover the input HIMF. We investigate below the 
effect that Dobs 7^ i'true lias on the recovered HIMF for 
various survey geometries and depths. 

3. RESULTS AND DISCUSSION 
3.1. Simulated HIPASS BGC 

First, we consider a mock catalog constructed to mimic 
the HIPASS BGC which contains the 1000 galaxies in the 
southern sky with the largest peak HI flux (see Z03). To 
remove the effects of cosmic variance an average obtained 
from 10 simulations is used. 

Most of the galaxies in the real BGC have; their dis- 
tances estimated using Bubble's Law, and yet are within 
cz = 3000 km s~^. We investigate what effect this as- 
sumption has (if any) on the derived HIMF. As noted in 
Table 1, the HIMF derived from the HIPASS BGC has a 
shallower low mass end slope than that derived from the 
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Fig. 1. — Difference between input and reconstructed HIMFs 
for a mock HIPASS BGC survey (shown are the average of 10 
simulations). The HIMF derived when "true" distances are used 
(circular points and dotted line), along with the HIMF constructed 
when multi-attractor model distances are used (triangular points 
and long-dashed line) reconstructs the input HIMF within the er- 
rors (here typical Poisson counting errors are shown only on the 
"true" distance points). When pure Hubble flow is assumed (square 
points and short dashed line), the low mass end of the HIMF is un- 
derestimated. The solid line shows the Z03 HIMF, which differs 
from the input by Aa = 0.1. 
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Fig. 2. — Difference between input and reconstructed HIMFs 
from simulated ALFA drift scans of Virgo (shown is the average of 
10 simulations; roughly 4000 galaxies are seen in each survey). As 
in Figure the input HIMF (that of Z03, but with a steepened to 
-1.4) has been subtracted, but note that here the y-scale has been 
extended down to accommodate the huge deviation when Hubble 
flow distances are used. Line and point types are the same as in 
Figure When the "true" distances are used the fit to the low 
mass slope of the HIMF is a = —1.37; the multi-attractor model 
distances give a = —1.36, while the Hubble flow distances give 
a = -1.28. 



ADBS in RS02. We will see if any part of this discrep- 
ancy can be accounted for by the use of Hubble's Law to 
assign distances by Z03. 

Figure ^ shows the difference between the input and 
reconstructed HIMFs for the input (or "true") distances, 
distances derived by assuming Hubble flow, and distances 
from the multi-attractor model of TBOO. Clearly, sig- 
nificant variations arise among the HIMFs constructed 
from the mock HIPASS BGC when different distance es- 
timators are used. The HIPASS survey volume does not 
contain the Virgo Cluster and therefore has a fairly quiet 
local velocity field; however, as can be seen from the fig- 
ure, the low mass slope of the HIMF will be underesti- 
mated if pure Hubble flow is assumed. In Figure ^ the 
input HIMF is the same as Z03 (see Table 1), except that 
the low mass slope has been steepened to a = —1.4. The 
HIMF recovered from this mock HIPASS BGC is close 
to that reported by Z03 when Hubble flow is assumed. 

We suggest that an adjustment of Aa ~ —0.1 should 
be applied to the Z03 low mass slope, making it a = 
— 1.4± 0.1. This correction makes the result consistent 
with the RS02 result of a = -1.53 ± 0.2 (where here 
the error on the RS02 result is estimated by scaling that 
reported for Z03 by ViVgai)- This change in low mass 
slope, while interesting in the light of comparisons with 
ACDM, has little effect in the total HI mass density im- 
plied by the surveys. The total mass density, dominated 
by Mi, galaxies, is already in agreement between the two 
surveys. 

3.2. Simulated Virgo cluster survey 

As the nearest rich cluster to us, Virgo represents the 
highest density region in the local Universe, and thus 
constitutes the obvious choice for the study of the HIMF 
in a high density environment. The northern hemisphere 
sky accessible to the Arecibo telescope covers both the 
Virgo cluster and comparably-far regions of lowest cos- 
mic density located roughly in the anti- Virgo direction. 



The arrival in 2004 of the Arecibo L-Band Feed Ar- 
ray (ALFA) will revolutionize mapping capability at the 
Arecibo telescope and will allow study of possible vari- 
ations in the HIMF in different density environments, 
already suggested by RS02 but with low statistical sig- 
nificance. Here, we investigate the potential of a pro- 
posed ALFA survey of an area 30° x 30° centered on 
the Virgo cluster (RA=11 - 13'\ DEC^O - 27°), assum- 
ing Tsys/G = 3 Jy, and with ii„t — 12s (drift scan) and 
CBW=25 kHz. The mean noise in such a survey would 
be ~ 4 mjy/beam (at our assumed maximum resolution 
of 5 km s~^). This can be compared to 20 mJy/beam 
for HIPASS and ^ 8.6 mJy/beam for the ADBS, see 
iBarnes et al. I 1)2001 !) and RS02; here both values have 
been scaled to a resolution of 5 km s~^. We also consider 
an equal area in the anti- Virgo region. (RA=23 — 1^, 
DEC=— 27 — 0°). This region of the sky is not accessible 
to Arecibo, but for the purposes of illustrating the effects 
of distance uncertainties on the construction of HIMFs, 
the combination of such a Virgo-anti- Virgo survey pro- 
vides a good comparison between the nearest regions of 
high and low cosmic density. 

In the Virgo region, we are able to reconstruct the 
input HIMF well when the "true" distances are used, 
and reasonably well with distances taken from the multi- 
attractor model of TBOO. However, if pure Hubble fiow 
is assumed, the number of galaxies at the low mass end 
of the HIMF is seriously underestimated (see Figure 
The lowest mass galaxies in our mock survey are seen 
only in the foreground of Virgo (the mass limit at 17 
Mpc, the approximate distance of the Virgo cluster, is 
~ 10^ M0 from Eqn 2 assuming Whi=100 km s~^ and 
CBW=25 kHz). If pure Hubble flow is assumed the vast 
majority of these foreground galaxies will have their dis- 
tances (and hence masses for a given HI flux) overesti- 
mated, since they are in-falling towards Virgo. 

The two recent Arecibo-based derivations of the HIMF 
(Z97 and RS02) include part of the Virgo cluster within 
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Fig. 3. — Difference between input and reconstructed HIMFs 
from simulated ALFA drift scans of the Anti- Virgo region (shown 
is the average of 10 simulations; roughly 3000 galaxies are seen in 
each survey) . Line and point types are the same as in Figures 
& 1^ Note that this region will not be accessible to ALFA, but 
is included here to illustrate the difference between comparable 
surveys of the nearest regions of high and low cosmic density. 



the overall survey area. The low mass end slopes derived 
from these surveys differ by Aa = 0.3 (with Z97 having 
the shallower slope). In RS02, the multi-attractor model 
of TBOO is used to assign distances, while in Z97 pure 
Hubble flow is assumed. However, the difference between 
the two HIMFs cannot be explained by their use of dif- 
ferent distance estimators. The Z97 survey was designed 
to be extremely sensitive but covers a very small area. 
In deeper surveys, we expect the effect described above 
diminishes. We have constructed a Z97-like survey (of 
similar depth and area), and, as expected, the HIMF can 
be reconstructed equally well in such a deep survey with 
all three distance estimates. We therefore must resort 
to the usual arguments of small-number statistics (not- 
ing that there were only 65 galaxies in the Z97 HIMF) 
and differing treatments of completeness to explain the 
discrepancy between the HIMFS of Z97 and RS02. 

In the Anti- Virgo region, the input HIMF can be re- 
constructed well using either "true" distances or by as- 
suming Hubble flow. Here the use of the multi-attractor 
model of TBOO causes the largest bias; acting to create 
an underestimate of the number of low mass HI galax- 
ies (see Figure 13). The multi-attractor model of TBOO 
is based on distances to ~ 300 elliptical galaxies. There 
are approximately 30 of these galaxies in the Virgo re- 
gion discussed here, but fewer than 5 in the anti- Virgo 
region. At the depth of these HI surveys, the anti- Virgo 
region is relatively under-dense, making it unavoidable 
that there will be few tracer galaxies in that direction. 
This undersampling has the effect of forcing the model 
for the velocity fleld to fit better towards Virgo at the 
expense of the fit in the anti- Virgo direction. The multi- 
attractor model of TBOO consistently over-estimates the 



distance for a given velocity in the anti- Virgo direction 
leading to the observed underestimate of the number of 
low mass galaxies (as discussed above). 

4. CONCLUSIONS 

We have used mock HI survey data to demonstrate 
that distance uncertainties can create serious biases in 
the low mass end of the HIMF; similar arguments could 
be applied to studies of the field galaxy optical LF. When 
the effects of the local peculiar velocity fleld are ignored, 
incorrect estimates of the HIMF will be derived. The 
exact nature and size of the bias will depend both on the 
survey geometry and its depth. 

In a survey constructed to simulate the HIPASS BGC, 
the low mass HIMF slope is underestimated if pecu- 
liar velocities are neglected. We suggest that the HIMF 
reported by Z03 suffers from this problem, which when 
corrected, would bring the Z03 and RS02 HIMFs into 
agreement within their errors. The adjusted value of the 
Z03 low mass slope is a — — 1.4 ± 0.1, which is to be 
compared with the RS02 value of a = — 1.5±0.2. This 
change does not affect the implied HI mass density since 
it is dominated by M^, galaxies. 

For a survey where low mass galaxies are seen only 
in the foreground of Virgo, the distances (and therefore 
masses) of the nearby low mass galaxies will be overes- 
timated and the low mass end of the HIMF seriously 
underestimated if in-fall onto Virgo is neglected. In 
deeper surveys, the effect becomes negligible, so that dif- 
ferences in the adopted distances alone cannot explain 
the discrepancy between the Z97 and RS02 HIMFs. 

A simulated survey in the anti- Virgo region points to 
the limitations of currently available models for the local 
velocity fleld. In such a survey, use of distances taken 
from a multi-attractor model for the local velocity fleld 
leads to an underestimate of the low mass HIMF slope. 
Currently available flow models are not well constrained 
in the anti- Virgo direction because of a lack of galax- 
ies with primary distances in the region. Velocity fleld 
models based on larger numbers of tracer galaxies will 
be required to solve this problem. 

The use of a good model for the local peculiar veloc- 
ity fleld is found to be essential to provide distances for 
nearby galaxies for which no redshift-independent dis- 
tances are available. Without such a model, it is difficult 
to derive an unbiased HIMF, and in particular, the low 
mass end may be in error. Disentangling the effects of 
environment and evolution of the HIMF will not be possi- 
ble unless the local flow model is well understood. Other 
applications where distances to galaxies are needed will 
also beneflt from a better understanding of local flows. 
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